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SEQUENCE LISTING 



<110> Nilles, Matthew L. 
Mat son, Jyl S. 

<120> YERSINIA SPECIES COMPOSITIONS 

<130> 3128-6046US 

<140> 10622220 

<141> 2003-07-18 

<150> US 60/444,076 

<151> 2003-01-31 

<160> 20 

<170> Patentin version 3.2 

<210> 1 

<211> 87 

<212> PRT 

<213> Yersinia pestis 

<220> 

<223> amino acid sequence of YscF 
<220> 

<223> sequence can be found at MedLine accession number NP_857921.1 

<400> 1 

Met Ser Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp lie Ala Asp Leu 
15 10 15 



Asp Ala Val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 



Ala Val Asn Asp Ser lie Ala Ala Leu Lys Asp Lys Pro Asp Asn Pro 
35 40 45 



Ala Leu Leu Ala Asp Leu Gin His Ser lie Asn Lys Trp Ser Val lie 
50 55 60 



Tyr Asn lie Asn Ser Thr lie Val Arg Ser Met Lys Asp Leu Met Gin 
65 70 75 80 



Gly lie Leu Gin Lys Phe Pro 
85 



<210> 2 
<211> 264 



<212> DNA 

<213> Yersinia pestis 
<220> 

<223> nucliec acids encoding YscF 
<220> 

<223> sequence can be found at MedLine accession number NL_004839 
<400> 2 

atgagtaact tctctggatt tacgaaagga accgatatcg cagacttaga tgcggtggct 60 
caaacgctca agaagccagc agacgatgca aacaaagcgg ttaatgactc gatagcagca 120 
ttgaaagata agcctgacaa cccggcgcta cttgctgact tacaacattc aattaataaa 18 0 
tggtcggtaa tttacaatat aaactcaacc atagttcgta gcatgaaaga cttaatgcaa 24 0 
ggcatcctac agaagttccc ataa 264 

<210> 3 
<211> 264 
<212> DNA 

<213> Yersinia enterocolit ica 
<220> 

<223> nucleic acid encoding YscF from Y. enterocolit ica 

<220> 

<22 3> sequence can be found at MedLine accession number NC_0 05017 

<400> 3 

atgagtaatt tctctgggtt tgcaaaagga accgatatca cagacttaga tgcggtggct 60 
caaacgctca agaagccagc agacggcgca aacaaagcgg ttaatgactc gatagcagcg 12 0 
ttgaaagaga cgcctgacaa cccggcgcta cttgctgact tacaacattc aattaataaa 18 0 
tggtcggtaa tttacaatat aagctcaacc atagttcgta gcatgaaaga cttaatgcaa 240 
ggcatcctac agaagttccc ataa 264 



<210> 4 

<211> 87 

<212> PRT 

<213> Yersinia enterocolit ica 



<220> 

<223> amino acid sequence of YscF from Y. enterocolit ica 
<220> 

<223> sequence can be found at MedLine accession number NP_863538.1 



<400> 



4 



Met Ser Asn Phe Ser Gly Phe Ala Lys Gly Thr Asp lie Thr Asp Leu 
15 10 15 



Asp Ala Val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 



Ala Val Asn Asp Ser lie Ala Ala Leu Lys Asp Thr Pro Asp Asn Pro 
35 40 45 



Ala Leu Leu Ala Asp Leu Gin His Ser lie Asn Lys Trp Ser Val lie 
50 55 60 



Tyr Asn Met Ser Ser Thr lie Val Arg Ser Met Lys Asp Leu Met Gin 
65 70 75 80 



Gly lie Leu Gin Lys Phe Pro 

85 



<210> 5 
<211> 80 
<212> PRT 

<213> Salmonella enterica 
<220> 

<223> amino acid sequence of PrgI 
<220> 

<223> sequence can be found at MedLine accession number CAD05980.1 
<400> 5 

Met Pro Thr Ser Trp Ser Gly Tyr Leu Asp Glu Val Ser Ala Lys Phe 
15 10 15 



Asp Lys Gly Val Asp Asn Leu Gin Thr Gin Val Thr Glu Ala Leu Asp 
20 25 30 



Lys Leu Ala Ala Lys Pro Ser Asp Pro Ala Leu Leu Ala Ala Tyr Gin 
35 40 45 



Ser Lys Leu Ser Glu Tyr Asn Leu Tyr Arg Asn Ala Gin Ser Asn Thr 
50 55 60 



Val Lys Val Phe Lys Asp lie Asp Ala Ala lie lie Gin Asn Phe Arg 
65 70 75 80 



<210> 6 
<211> 243 
<212> DNA 

<213> Salmonella enterica 
<220> 

<223> sequence can be found at MedLine accession number AL627276 
<220> 

<223> nucleic acid encoding PrgI 

<400> 6 

atgccaacat cttggtcagg ctatctggat gaagtttcag caaaatttga taagggcgtt 60 
gataatctac aaacgcaggt aacagaggcg ctggataaat tagcagcaaa accctccgat 12 0 

ccggcgctac tggcggcgta tcagagtaag ctctcggaat ataacttgta ccgtaacgcg 18 0 
caatcgaaca cggtaaaagt ctttaaggat attgatgctg ccattattca gaacttccgt 240 
taa 243 



<210> 7 
<211> 83 
<212> PRT 

<213> Shigella flexneri 

<220> 

<223> amino acid sequence of MxiH 

<220> 

<223> sequence can be found at MedLine accession number NP_858270.1 
<400> 7 

Met Ser Val Thr Val Pro Asn Asp Asp Trp Thr Leu Ser Ser Leu Ser 
15 10 15 



Glu Thr Phe Asp Asp Gly Thr Gin Thr Leu Gin Gly Glu Leu Thr Leu 
20 25 30 



Ala Leu Asp Lys Leu Ala Lys Asn Pro Ser Asn Pro Gin Leu Leu Ala 

35 40 45 



Glu Tyr Gin Ser Lys Leu Ser Glu Tyr Thr Leu Tyr Arg Asn Ala Gin 
50 55 60 



Ser Asn Thr Val Lys Val lie Lys Asp Val Asp Ala Ala lie lie Gin 
65 70 75 80 



Asn Phe Arg 



60 
120 
180 

240 
252 



<210> 9 

<211> 73 
<212> PRT 

<213> Escherichia coli 

<220> 

<223> amino acid sequence of EscF 
<220> 

<223> sequence can be found at MedLine accession number NP_31257 9.1 
<400> 9 

Met Asn Leu Ser Glu lie Thr Gin Gin Met Gly Glu Val Gly Lys Thr 
15 10 15 



Leu Ser Asp Ser Val Pro Glu Leu Leu Asn Ser Thr Asp Leu Val Asn 
20 25 30 



Asp Pro Glu Lys Met Leu Glu Leu Gin Phe Ala Val Gin Gin Tyr Ser 

35 40 45 



Ala Tyr Val Asn Val Glu Ser Gly Met Leu Lys Thr lie Lys Asp Leu 
50 55 60 



Val Ser Thr lie Ser Asn Arg Ser Phe 



<210> 8 
<211> 252 
<212> DNA 

<213> Shigella flexneri 
<220> 

<223> nucleic acid encoding MxiH 

<220> 

<223> sequence can be found at MedLine accession number Nc_004851 

<400> 8 

atgagtgtta cagtaccgaa tgatgattgg acattgagtt cattatctga aacttttgat 
gatggaactc aaacattaca aggtgaacta acattggcac tagataaatt agctaaaaat 
ccttcgaatc cacagttgct ggctgaatac caaagtaaat tatctgaata tacattatat 

aggaacgcgc aatccaatac agtgaaagtg attaaggatg ttgatgctgc aattattcaa 
aacttcagat aa 



65 



70 



<210> 10 

<211> 222 
<212> DNA 

<213> Escherichia coli 

<220> 

<223> nucleic acid encoding EscF 

<220> 

<223> sequence can be found at MedLine accession number NC_002 695 

<400> 10 

atgaatttat ctgaaattac tcaacaaatg ggtgaagtag gtaaaacgct gagcgattct 60 
gtgccagagt tacttaatag caccgatttg gttaatgacc ctgaaaaaat gttagagttg 12 0 
cagtttgcgg ttcagcaata ttctgcttat gttaacgtag aaagtggaat gttgaaaacg 180 
ataaaagatc tggtctcaac catttctaac cgtagttttt aa 222 



<210> 11 

<211> 240 

<212> DNA 

<213> Yersinia pestis 
<220> 

<223> nucleic acid encoding HT-YscF; an example of a recombinant YscF 
<220> 

<221> misc_feature 

<222> (214) . . (237) 

<223> His-tag sequence 

<400> 11 

atgagtaact tctctggatt tacgaaagga accgatgatg caaacaaagc ggttaatgac 60 

tcgatagcag cattgaaaga taagcctgac aacccggcgc tacttgctga cttacaacat 12 0 

tcaattaata aatggtcggt aatttacaat ataaactcaa ccatagttcg tagcatgaaa 180 

gacttaatgc aaggcatcct acagaagttc ccactcgagc accaccacca ccaccactga 24 0 



<210> 12 

<211> 95 

<212> PRT 

<213> Yersinia pestis 
<220> 

<223> amino acid sequence of HT-YscF; an example of a recombinant YscF 

<220> 



<221> 
<222> 
<223> 



misc_f eature 
(88) . . (95) 
His-tag 



<400> 12 

Met Ser Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp lie Ala Asp Leu 
15 10 15 



Asp Ala Val Ala Gin Thr Leu Lys Lys Pro Ala Asp Asp Ala Asn Lys 
20 25 30 



Ala Val Asn Asp Ser lie Ala Ala Leu Lys Asp Lys Pro Asp Asn Pro 
35 40 45 



Ala Leu Leu Ala Asp Leu Gin His Ser lie Asn Lys Trp Ser Val lie 
50 55 60 



Tyr Asn lie Asn Ser Thr lie Val Arg Ser Met Lys Asp Leu Met Gin 
65 70 75 80 



Gly lie Leu Gin Lys Phe Pro Leu Glu His His His His His His 
85 90 95 



<210> 13 

<211> 5537 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> pMJ119 

<220> 

<221> misc_f eature 

<222> (563) . . (1375) 

<22 3> kan encoded on complementary strand 
<220> 

<221> misc_feature 

<222> (3518) . . (4597) 

<22 3> Lad encoded on complementary strand 

<220> 

<221> promoter 

<222> (4983) . . (5000) 

<223> T7 promoter 

<220> 

<221> misc_feature 

<222> (5001) . . (5001) 

<223> T7 transcription start 



<220> 

<221> misc_f eature 

<222> (5072) . . (5104) 

<223> T7 Tag 

<220> 

<221> CDS 

<222> (5114) . . (5401) 

<223> HT-YscF 

<220> 

<221> misc_f eature 

<222> (5381) . . (5398) 

<223> His-tag sequence 

<220> 

<221> terminator 

<222> (5466) . . (5512) 

<223> T7 terminator 

<400> 13 

tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60 

cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 

ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180 

gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 24 0 

acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 30 0 

ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 

ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 

acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 48 0 

tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 54 0 

tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 60 0 

tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660 

actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 72 0 

gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780 

aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 84 0 

agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 90 0 

cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960 

aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 102 0 

tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080 



tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 114 0 

taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200 

ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260 

tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320 

tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380 

cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440 

cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 150 0 

gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560 

gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 162 0 

agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 168 0 

aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740 

agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 180 0 

cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 18 60 

accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 192 0 

aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 198 0 

ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 2 04 0 

cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 210 0 

gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160 

tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220 

agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 2280 

tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340 

caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400 

ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460 

gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2 52 0 

gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580 

gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2 64 0 

aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 2700 

ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 2760 



acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 2 82 0 

ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 2 88 0 

tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 2 94 0 

tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 300 0 

cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 30 60 

gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 312 0 

ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc 3180 

catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa 324 0 

ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc 330 0 

gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac 3360 

gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca 342 0 

ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta 348 0 

atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 354 0 

cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3 60 0 

tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3 660 

ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 372 0 

aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 3780 

atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 384 0 

cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900 

gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960 

tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4 02 0 

agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4 08 0 

gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 414 0 

ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200 

catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260 

tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc 4320 

tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4 38 0 

gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4 44 0 

ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500 



tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4 560 

catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4 62 0 

cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4 680 

tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4 74 0 

ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4 80 0 

ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860 

cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4 92 0 

gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgagatctc gatcccgcga 4 980 

aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 504 0 

ttttgtttaa ctttaagaag gagatataca tatggctagc atgactggtg gacagcaaat 5100 

gggtcgggat ccg atg agt aac ttc tct gga ttt acg aaa gga acc gat 514 9 
Met Ser Asn Phe Ser Gly Phe Thr Lys Gly Thr Asp 
15 10 

ate gca gac tta gat gcg gtg get caa acg etc aag aag cca gca gac 5197 

lie Ala Asp Leu Asp Ala Val Ala Gin Thr Leu Lys Lys Pro Ala Asp 
15 20 25 

gat gca aac aaa gcg gtt aat gac teg ata gca gca ttg aaa gat aag 52 4 5 

Asp Ala Asn Lys Ala Val Asn Asp Ser lie Ala Ala Leu Lys Asp Lys 

30 35 40 

cct gac aac ccg gcg eta ctt get gac tta caa eat tea att aat aaa 5293 

Pro Asp Asn Pro Ala Leu Leu Ala Asp Leu Gin His Ser lie Asn Lys 
45 50 55 60 

tgg teg gta att tac aat ata aac tea ace ata gtt egt age atg aaa 5341 

Trp Ser Val lie Tyr Asn lie Asn Ser Thr lie Val Arg Ser Met Lys 
65 70 75 



gac tta atg caa ggc ate eta cag aag ttc cca etc gag cac cac cac 5389 
Asp Leu Met Gin Gly lie Leu Gin Lys Phe Pro Leu Glu His His His 



